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(57) Abstract: The invention provides a method and system for routing or switching File system requests in a mass storage system. 
The mass storage system includes multiple storage devices coupled using fiber channel to a routing or switching device disposed 
internally within each shelf of mass storage devices in the mass storage system, and coupled direcdy to each individual mass storage 
device in that shelf. The switching device receives requests, identifies an individual target storage device for each such request, 
and sends each such request to its designated target storage device. The switching device also receives responses to such requests 
from each individual storage device within the shelf, and sends those responses from the storage device to an external medium 
with which the response can be delivered to the original requester. One advantage of the switching device is that the switching 
device can maintain communication bandwidth between the mass storage devices and external devices approximately equal to the 
communication bandwidth of a single fiber channel times the number of point-to-point connections. 
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SWITCHING FILE SYSTEM REQUESTS IN A MASS STORAGE SYSTEM 

Background of the Invention 

5 L Field of the Invention 

This invention relates to routing or switching file system requests in a 
mass storage system, such as one having multiple storage devices. 

10 2. Related Art 

Mass storage systems can include multiple storage devices, such as disk 
drives. When a file server request arrives at the mass storage system, the request is 
identified as destined for a target one of the multiple storage devices, and is sent to 

1 5 that identified target one storage device for processing (and for a possible response). 
In a first set of known devices, the file server request is received by a hub device, 
which propagates the file server request to its target storage device, using a fiber 
channel arbitrated loop interface. Thus, the hub is coupled to each shelf of mass 
storage devices (where each shelf of storage devices includes an enclosure housing a 

20 plurality of storage devices) using a star configuration of fiber channel arbitrated 

loop; thus, the loop couples each shelf of mass storage devices to the hub and couples 
each shelf of mass storage devices back to the hub. In a second set of known devices, 
the file server request is received by a switching device, which identifies the target 
storage device within the mass storage system. The switching device then sends the 

25 file server request directly to the shelf of mass storage devices including the target 
storage device, using a fiber channel arbitrated loop interface (the loop including each 
of the mass storage devices in that shelf); the switching device receives responses 
directly from that shelf of mass storage devices and routes those responses back to the 
original requesting device making the file server request. 

30 
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One problem with known systems is that the fiber channel arbitrated 
loop is subject to error, including the possibility that the fiber channel has a break, 
disconnection, or other failure, and the additional possibility that one or more of the 
storage devices in the mass storage system will fail to forward file system requests to 
5 subsequent storage devices in the fiber channel arbitrated loop. In known devices 
including a hub, the hub monitors each connection to each shelf of mass storage 
devices (thus, paying attention to each of its ports, as well as to timing and signal 
requirements for signals on each connection). If the hub detects an error on a loop to 
or from a mass storage device in a shelf of storage devices, it bypasses the connection 
1 0 to that one shelf of storage devices, and effectively deletes the connectivity to all 
storage devices in that one shelf. 

Although these known systems generally achieve a result of isolating 
errors in the fiber channel arbitrated loop, they are subject to several drawbacks. In 

1 5 cases when the hub detects errors, file server requests can fail to be sent to the target 
storage device. Similarly, in cases when the hub detects errors, even if the file server 
request is properly sent to the target storage device, one or more responses from the 
target storage device can fail to be properly sent back to the original requesting 
device. In systems where a hub or switch is used to route file server requests among 

20 multiple shelves of mass storage devices, isolating one of the fiber channel arbitrated 
loops has the effect of isolating an entire shelf of mass storage devices; in these cases, 
advantages obtained from redundancy of mass storage devices in the shelf (such as in 
RAID systems) are generally lost. 

25 A first known method for attempting to remedy this problem is to 

provide redundant system elements, such as a plurality of fiber channel loops, a 
plurality of couplings between fiber channel loops and individual storage devices, or 
a plurality of elements at each other single point of failure. Although this first known 
method generally achieves the result of guarding against failure of the mass storage 

30 system, it is subject to several drawbacks. First, redundant system elements have 
additional cost, thus increasing the cost of the entire mass storage system. Second, 
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system design using redundant system elements is more complex than without using 
redundant system elements, again increasing the cost of the entire mass storage 
system. Third, fiber channel drives are subject to failure modes that can bring down 
multiple fiber channel loops at once (for example, a bad crystal in a fiber channel 
5 drive will cause a mismatch in frequency and bring down both fiber channel links to 
which the fiber channel drive is coupled). 

A second known method for attending to remedy this problem is to use 
serial channel techniques other than fiber channel arbitrated loops. One such 
10 alternative serial channel technique is SSA, which is less subject to the drawbacks 
noted of the known art. However, use of SSA is subject to other drawbacks — fiber 
channel arbitrated loop is a known standard, and thus makes it easier to design 
systems for use with that known standard. 

15 Accordingly, it would be advantageous to provide a technique for 

routing or switching file system requests in a mass storage system that is not subject 
to drawbacks of the known art. Such a technique would preferably include a 
capability for routing or switching file system requests in a mass storage system that 
is not subject to failure of either a fiber channel arbitrated loop or of any individual 

20 storage device. 

Summary of the Invention 

The invention provides a method and system for routing or switching 
25 requests in a mass storage system. In a preferred embodiment, the mass storage 
system includes multiple storage devices coupled using fiber channel to a routing or 
switching device disposed internally within each shelf of mass storage devices in the 
mass storage system, and coupled directly to each individual mass storage device in 
that shelf. The switching device receives requests, identifies an individual target 
30 storage device for each such request, and sends each such request to its designated 
target storage device. The switching device also receives responses to such requests 
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from each individual storage device within the shelf, and sends those responses from 
the storage device to an external medium with which the response can be delivered to 
the original requester. One advantage of the switching device is that the switching 
device can maintain communication bandwidth between the mass storage devices and 
5 external devices approximately equal to the communication bandwidth of a single 
fiber channel times the number of point-to-point connections. 

The invention provides an enabling technology for a wide variety of 
applications for routing or switching requests in a mass storage system, so as to 
10 obtain substantial advantages and capabilities that are novel and non-obvious in view 
of the known art. Examples described below primarily relate to mass storage systems 
including a plurality of storage devices coupled using fiber channel arbitrated loop, 
but the invention is broadly applicable to many different types of storage systems. 

15 Brief Description of the Drawings 

Figure 1 shows a block diagram of a system for routing or switching 
requests in a mass storage system. 

20 Figure 2 shows a data flow diagram in a system for routing or switching 

requests in a mass storage system. 

Figure 3 shows a process flow diagram of a method for operating a 
system for routing or switching requests in a mass storage system. 

25 

Detailed Description of the Preferred Embodiment 

In the following description, a preferred embodiment of the invention is 
described with regard to preferred process steps and data structures. Embodiments of 
30 the invention can be implemented using general purpose processors or special 
purpose processors operating under program control , or other circuits, adapted to 
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particular process steps and data structures described herein. Implementation of the 
process steps and data structures described herein would not require undue 
experimentation or further invention. 

5 Lexicography 



The following terms refer or relate to aspects of the invention as 
described below. The descriptions of general meanings of these terms are not 
intended to be limiting, only illustrative. 

• client and server — in general, these terms refer to relationships between a 
client and a server, not necessarily to particular physical devices 



• client device and server device - in general, these terms include any device 
15 taking on the role of a client or a server in a client-server relationship; there is 

no particular requirement that any client device or any server device must be 
an individual physical device (each can be either a single device, a set of 
cooperating devices, a portion of the device, or some combination thereof) 



20 • communication bandwidth — in general, a measure of the amount of 
information being sent per unit time between or among devices 



• disposed internally — in general, with regard to a mass storage system, a 
device disposed so as to be treated by external devices as part of that mass 

25 storage system 

• external medium — in general, a communication medium or a device 
external to the mass storage system and coupled to the mass storage system for 
inter-operation therewith 

30 
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fiber channel — in general, a communication medium including optical fiber 
for sending information, including without limitation either (a) point-to-point 
communication between pairs of devices, or (b) arbitrated communication 
among a sequence of devices configured in a loop 

hub or switching device — in general, a device for communicating messages 
among storage devices in a mass storage system and points external to the 
mass storage device; thus, a connectivity device for storage devices in a mass 
storage system 

request — in general, a message from a requesting device, to a mass storage 
system, such as indicating a request to perform a file system operation 

mass storage shelf (or enclosure) — in general, a device used to house one or 
more storage devices; in a preferred embodiment, each mass storage shelf 
includes at least a portion of the mass storage system 

mass storage system — in general, a device whose purpose is to provide 
storage and retrieval of information; in a preferred embodiment, the mass 
storage system includes one or more storage devices housed in one or more 
mass storage shelves (or enclosures) 

original requester — in general, the client device that made an original 
request, and to which one or more responses to that request are to be delivered 

point-to-point connection — in general, a communication link between a 
sending device and a receiving device 

responses to requests — in general, a message from a device (such as a mass 
storage system) to a requesting device, such as indicating a response to file 
system request 

6 
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• routing or switching — in general, forwarding messages (such as requests or 
responses to those requests) from a first device (such as a requesting device) to 
a second device (such as a mass storage device within a mass storage system) 

5 

• storage device — in general, a device for storing and retrieving information, 
such as for example a magnetic storage device, an optical or magneto-optical 
storage device, or one or more disk drives 

10 • switching device — in general, one or more devices for performing routing or 
switching 

As noted above, these descriptions of general meanings of these terms 
are not intended to be limiting, only illustrative. Other and further applications of the 
15 invention, including extensions of these terms and concepts, would be clear to those 
of ordinary skill in the art after perusing this application. These other and further 
applications are part of the scope and spirit of the invention, and would be clear to 
those of ordinary skill in the art, without further invention or undue experimentation. 

20 System Elements 

Figure 1 shows a block diagram of a system for routing or switching 
requests in a mass storage system. 

25 A mass storage system 100 includes a set of storage devices 1 10, a hub 

or switching device 1 20, a set of internal communication links 1 30, and one or more 
external communication links 140. The mass storage system 100 is disposed for 
coupling within a larger system, including at least one communication medium 150 
and at least one requesting device 160 (such as a file server client). 

30 
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In a preferred embodiment, the mass storage system 100 is disposed so 
as to include the entire set of storage devices 1 10, the hub or switching device 120, 
and the internal communication links 130, all within a single physical housing. 
However, there is no particular requirement that the mass storage system 100 be 
5 disposed within a single physical housing, and can instead include more than one 
physical housing. For example, a first plurality of storage devices 110 can be 
disposed within a first physical housing such as a first shelf, while a second plurality 
store devices 1 1 0 can be disposed within a second physical housing such as a second 
shelf. In this example, in a preferred embodiment, the first and second plurality of 
10 storage devices 1 10 would each have their own independent hub or switching device 
1 20 disposed within each shelf. 

In a preferred embodiment, each one storage device 1 10 comprises a 
mass storage device, such as a magnetic storage device, a magneto-optical or optical 
15 storage device, for example as embodied in a disk drive. However, there is no 

particular requirement that any storage device 1 10 be constructed or disposed using a 
particular technology. Moreover, there is no particular requirement that all storage 
devices 1 10 are constructed or disposed using the same or similar technologies. 

20 In a preferred embodiment, the hub or switching device 120 includes a 

device having the capability to receive messages and send those messages to 
individual ones of the storage devices 110. 

For a first example, where the hub or switching device 120 is a hub, the 
25 hub is coupled to each one of the storage devices 1 1 0 individually, so that the hub can 
send or receive a message from any one of the storage devices 1 10 individually 
without that message having to pass by a second one of the storage devices 110. One 
preferred configuration couples the storage devices 1 10 to the hub in a star 
configuration, so that the hub is coupled to each individual storage device 1 10 using a 
30 separate fiber channel arbitrated loop. 

8 
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For a second example, where the hub or switching device 120 is a 
switch, the switch is coupled to each one of the storage arises 1 10 individually, so 
that the switch can send or receive a message from any one of the storage devices 110 
individually. One preferred configuration couples the storage devices 1 10 to the 
5 switch in a crossbar configuration, so that the switch can couple any individual 
storage device 1 10 to any other individual storage device 110, and can couple any 
individual storage device 1 10 to an individual external communication link 140. 

In a preferred embodiment, the set of internal communication links 130 
10 include fiber channel, disposed in a star configuration, so that each storage device 
1 10 is coupled directly to the hub or switching device 120 using a pair of fiber 
channel links and forming a fiber channel loop. 

The hub or switching device 120 is coupled to the external 
15 communication links 140, so as to receive requests from the external communication 
links 140 and to send those requests to designated storage devices 1 10, and so as to 
receive responses to those requests from the storage devices 1 1 0 and to send those 
responses to the external communication links 140. 

20 System Operation 

Figure 2 shows a data flow diagram in a system for routing or switching 
requests in a mass storage system. 

25 A data flow diagram 200 includes a set of information flow paths 210 

and a set of messages 220 used for those information flow paths 210. 

A first information flow path 2 1 1 includes a request, such as possibly a 
file server request using NFS or a similar file server request protocol. A set of 
30 request messages 221 indicates the nature of the request. In a preferred embodiment, 
the request can comprise either a single request message 221 or a sequence of more 

9 
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than one request messages 22 1. as may be appropriate within the structure of a file 
server request protocol (or other request protocol) and with regard to the nature of the 
specific request. 

5 The requesting device 160 sends the request messages 221, using the 

communication medium 150, to the mass storage system 100, for distribution to the 
appropriate storage device 110 and response thereto. The mass storage system 100 
receive the request messages 221 and couples them to the hub or switching device 
120. The hub or switching device 120 receives the request messages 221 and 

10 determines which storage device 110 to which the request messages 221 should be 
sent. 

A second information flow path 212 includes the internal routing, 
within the mass storage system 100, of the first set of request messages 221. The hub 
15 or switching device 120 forwards the request messages 221 to the specific storage 
device 110 determined to be the appropriate target of the request messages 221. That 
storage device 1 10 receives the request messages 221, and proceeds to process them 
according to the file server request protocol (or other request protocol). 

20 A third information flow path 213 includes the internal routing, within 

the mass storage system 100, of a set of response messages 223, corresponding to a 
response to the original request indicated by the set of request messages 221. The 
target storage device 110 of the original request messages 221 responds to the 
original request; the response is indicated by the set of response messages 223. The 

25 target storage device 1 10 sends the response messages 223 to the switching device 
120, which receives the response passages 223. 

A fourth information flow path 214 includes the response to the original 
request. The set of response messages 223 indicates the nature of the response to the 
30 original request. In a preferred embodiment, the response can comprise either a 
single response message 223 or a sequence of more than one response messages 223, 

10 
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as may be appropriate within the structure of the file server request protocol (or other 
request protocol) and with regard to the nature of the specific request and the 
response thereto. 

The hub or switching device 120 sends the response messages 223, 
using the communication medium 150, to the original requesting device 160. 

Method of Operation 

Figure 3 shows a process flow diagram of a method for operating a 
system for routing or switching requests in a mass storage system. 

A method 300 includes a set of flow points and a set of steps. The mass 
storage system 100, in combination with and cooperation with the communication 
medium 150 and the requesting device 160, performs the method 300. Although the 
method 300 is described serially, the steps of the method 300 can be performed by 
separate elements in conjunction or in parallel, whether asynchronously, in a 
pipelined manner, or otherwise. There is no particular requirement that the method 
300 be performed in the same order in which this description lists the steps, except 
where so indicated. 

At a flow point 3 1 0, the requesting device 1 60 is ready to send a request 
to the mass storage system 1 00. 

At a step 3 1 1, the requesting device 160 prepares the request, including 
one or more request messages 221, according to the NFS file server protocol (or 
another appropriate protocol). 

At a step 312, the requesting device 1 60 sends the request messages 221 
to the mass storage system 100, using the communication medium 150. 
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At a step 3 13, the communication medium 150 communicates the 
request messages 221 from the requesting device 160 to the mass storage system 100. 

At a step 314, the mass storage system 100 receives the request 
5 messages 22 1 , at the hub or switching device 1 20. 

At a step 315, the hub or switching device 120 determines a target 
storage device 1 10 for the request messages 22 1 . 

10 At a step 3 16, the hub or switching device 120 sends the request 

messages 221 to the target storage device 110. 

At a step 3 1 7, the target storage device 1 1 0 receives the request 
messages 221. 

15 

At a step 3 18, the target storage device 1 10 processes the request 
messages 221, and formulates a response thereto. As part of this step, the target 
storage device 1 10 prepares the response to the request, including one or more 
response messages 223, according to the NFS file server protocol (or another 
20 appropriate protocol). 

At a step 319, the target storage device 110 sends the response 
messages 223 to the hub or switching device 120, with an indication that the response 
messages 223 should be forwarded to the requesting device 160 making the original 
25 request. 

At a step 320, the hub or switching device 120 receives the response 
messages 223 from the target storage device 1 1 0, and forwards them to the requesting 
device 160 making the original request, using the communication medium 150. After 
30 this step, the method 300 has received and handled the request. 

12 
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The method 300 can be performed one or more times starting from the 
flow point 310 and continuing therefrom. 

Generality of the Invention 

The invention has general applicability to various fields of use, not 
necessarily related to the services described above. For example, these fields of use 
can include one or more of, or some combination of, the following: 

• Mass storage systems including file servers and other devices in combination; 

• Mass storage systems including heterogeneous storage devices; and 

• Other types of storage systems. 

Other and further applications of the invention in its most general form, 
will be clear to those skilled in the art after perusal of this application, and are within 
the scope and spirit of the invention. 

Alternative Embodiments 

Although preferred embodiments are disclosed herein, many variations 
are possible which remain within the concept, scope, and spirit of the invention, and 
these variations would become clear to those skilled in the art after perusal of this 
application. 
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Claims 

1 . Apparatus including 

a housing, said housing enclosing a plurality of storage devices, a 
5 corresponding plurality of communication links each one of which is associated with 
one of said plurality of storage devices, and a hub or switching device coupled to said 
plurality of communication links; 

wherein said hub or switching device is disposed for receiving 
messages from a point external to said housing, and is disposed for forwarding said 
1 0 messages independently to an individual one of said storage devices. 

2. Apparatus as in claim 1, wherein a first one said storage device 
is capable of communicating with said hub or switching device independently of a 
communication link associated with a second said storage device. 

15 

3. Apparatus as in claim 1, wherein at least one of said 
communication links includes a fiber channel arbitrated loop. 

4. Apparatus as in claim 1 , wherein, for each pair of said storage 
20 devices, a first one of said pair is capable of communicating with said hub or 

switching device independently of a state of said communication link associated with 
a second one of said pair. 

5. Apparatus as in claim 1 , wherein said hub or switching device 
25 includes a hub. 

6. Apparatus as in claim 1, wherein said hub or switching device 
includes a switch. 

30 
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7. Apparatus as in claim 1, wherein said hub or switching device is 
disposed for receiving messages independently from an individual one of said storage 
devices, and is disposed for forwarding said messages independently to said point 
external to said housing. 



15 
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FIGURE 1 
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FIGURE 2 
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The requesting device 160 is ready to 
send a request to the mass storage 
system 100- 
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The requesting device 160 prepares the 
request, including one or more request 
messages 22 1 . 
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312 

The communication medium 150 
communicates the request messages 221 
from the requesting device 160 to the 
mass storage system 100. 
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The communication medium 
communicates the request messages 
from the requesting device 1 60 to the 
mass storage system 100. 
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The mass storage system 100 receives 
the request message 221 at the hub or 
switching device 120. 
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The hub or switching device 1 20 
determines a target storage device 1 10 
for the request message 221 . 
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The hub or switching device 120 sends 
the request messages 221 to the target 
storage device HQ. 
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The target storage device 1 10 receives 
the request messages 221. 
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The target storage device 110 processes 
the request messages 221 and formulates 
a response thereto, 
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The target storage device 110 sends the 
response messages 223 to the hub or 
switching device 120. 



320 

The hub or switching device 120 
receives the response messages 223 from 
the target storage device 1 1 0 and 
forwards them to the requesting device 
160, making the original request, using 
the communication medium 150. 
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